Apparatus and method for generating virtual reality content

ABSTRACT

A method for generating virtual reality (VR) content includes obtaining source information from original media content. retrieving characteristic information stored in a first memory based on the obtained source information wherein the retrieved characteristic information related to the original media content, generating the VR content by combining the retrieved characteristics information and the obtained source information, wherein at least one feature of the retrieved characteristic information is mapped to a corresponding at least one feature of the obtained source information, and outputting the generated VR content and the original media content to be reproduced to a viewer.

Virtual Reality (VR) content provides an enhanced viewing experience fora user by conveying the feeling of being immersed within an environment.Most VR systems enhance the experience by providing the user with theability to change the user's viewpoint within the content or withrespect to various elements of the content, and the environment,characters, and objects within the content may be presented to have athree-dimensional (3D) effect.

However, the creation of VR content typically involves advance planningand complex procedures during the content creation process in order tocapture all of the information required to implement a VR experience ofthe content.

For example, some VR content creation procedures include scanning theentire 360 degree environment of a scene to capture imagery and audiosimultaneously to store sufficient information to present a VRexperience from various and multiple angles for a user. Even in mostnon-VR implementations, the creation of 3D content from a single angletypically requires that the content be created with the intention ofimplementing the content in 3D, such as by using a dual lens 3Ddedicated camera, in order to capture all required information duringthe content creation process in order to achieve the best results.

Therefore, the ability for content creators and providers to dynamicallycraft and create VR content from existing two-dimensional (2D) and 3Dcontent without the benefit of preplanned and intentionally captured VRinformation during creation of the original content may be advantageous.Thus, there is a great need for a solution to effectively generate VRinformation from existing content.

SUMMARY

Embodiments of the present disclosure include a system and method forgenerating VR content based on original non-VR content, where particularfeatures of the original content are identified and used to retrievestored information to generate the VR content. In some embodiments, theoriginal content is 2D or 3D content, and the VR content is generatedbased on the original content and additional stored information relatedto the original content. In some embodiments, the additional storedinformation is 2D content or 3D content related to the identifiedparticular features of the original content. In some embodiments, thestored information is retrieved based on the identified particularfeatures.

In some embodiments, the identified particular features of the originalcontent include audio, video, or text of the original content.Embodiments of the present disclosure includes wherein the storedinformation includes multiple images of an identified particular featureof the original content, and the multiple images are used to generate VRcontent of the identified particular feature.

According to an embodiment of the present disclosure, stored informationrelated to a recognized object within the original content is searchedand retrieved. In this embodiment, the retrieved stored informationincludes 2D images of the recognized object or may include 3D content ofthe recognized object. The stored information may be associated with animage related to the original content, audio of the original content, ortext of the original content. The embodiment may further includegenerating VR content based on the retrieved stored information, and mayfurther include storing the generated VR content to be associated withaudio, video, or text of the recognized object.

An embodiment of a method of the present disclosure includes obtainingsource information from original content, retrieving characteristicinformation stored in a memory, the retrieved characteristic informationrelated to the original content based on the obtained sourceinformation, and generating VR content based on the obtained sourceinformation and the retrieved characteristic information.

An embodiment of a system of the present disclosure includes at leastone controller configured to obtain source information from originalcontent, retrieve characteristic information stored in a memory, theretrieved characteristic information related to the original contentbased on the obtained source information, and generate VR content basedon the obtained source information and the retrieved characteristicinformation. In some embodiments, the original content may include 2Dcontent or 3D content.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects and features of the present disclosure willbecome more apparent upon consideration of the following description ofembodiments, taken in conjunction with the accompanying drawing figures.

FIG. 1 is an illustration of a system for generating VR content inaccordance with an embodiment of the present disclosure.

FIG. 2 is an illustration of a content analyzer of the system inaccordance with an embodiment of the present disclosure.

FIG. 3 is an illustration of a method for generating VR content inaccordance with an embodiment of the present disclosure.

FIG. 4 is an illustration of an example of original source content,additional stored information related to the original content, andgenerated VR content in accordance with an embodiment of the presentdisclosure.

DETAILED DESCRIPTION

In the following detailed description, reference is made to theaccompanying drawing figures which form a part hereof, and which show byway of illustration specific embodiments of the present invention. It isto be understood by those of ordinary skill in this technological fieldthat other embodiments may be utilized, and that structural, electrical,as well as procedural changes may be made without departing from thescope of the present invention. Wherever possible, the same referencenumbers will be used throughout the drawings to refer to the same orsimilar parts.

Embodiments of the present invention relate to a system and method forgenerating VR content based on original non-VR content, usingpreexisting information related to identified features of the originalcontent.

Referring now to FIG. 1, an embodiment of a system 100 for generating VRcontent is depicted. The system 100 includes a VR generator 103 forreceiving original content from an original content database 101. Theoriginal content may include non-VR content such as traditional 2Dcontent, and may also include 3D content which is not implemented as VRcontent. For example, the original content may include, withoutlimitation, television broadcasts, sporting events, 2D movies, 2Dmultiview content, 3D movies, online video content, and the like. Theembodiment of FIG. 1 depicts the source of the original contentrepresented by a file based database system, however it will beunderstood by those of ordinary skill in the art that various othercontent sources may be implemented in the system 100, including non-filebased sources such as live television broadcasts, streaming internetcontent, fixed format content sources, and the like.

The VR generator 103 may include a content analyzer 103 configured toprocess the original content received from the original content database101. The content analyzer 103 may be configured to analyze variousfeatures of the original content, including object and scenerecognition, face detection and recognition, audio and voicerecognition, audio and voice detection, voice-to-text analysis,multi-channel audio analysis, graphic text analysis, and the like. Thecontent analyzer 103 may perform the various analysis procedures,discussed in further detail below, and identify various features of theoriginal content, including people, objects, environments, environmentconditions, entire scenes, and the like.

Based on features of the original content identified by the contentanalyzer, 103, additional information related to the identified featuresmay be retrieved from a characteristics database 104. Thecharacteristics database 104 may store various information related toidentified people, objects, environments, environment conditions, andthe like, and the characteristics database information may be stored tobe associated with various metadata, wherein the metadata may correspondto audio, video, images, or text identified in the original content bythe content analyzer 103. The stored information may include relatedvideo or images of the identified features which may be used to generatea 3D or VR content of the identified feature.

For example, the content analyzer 103 may detect a face of an actordepicted in the original content and perform facial recognition toidentify a certain actor. The content analyzer 103 may query thecharacteristics database 104 with the name of the actor, or othermetadata generated by based on the identified actor, to requestadditional stored information from the characteristics database. Thequery may include the actor's name, a unique identifier associated withthe actor, or other metadata otherwise identifying the actor identifiedin the scene.

The content analyzer may retrieve additional stored content associatedwith the identified actor from the characteristics database 104,including video and images of the identified actor from various sides,angles, lighting, distances, and the like. The additional stored contentmay include any additional content which may contribute to thegeneration of VR content based on the original content, such as audioand video related to features of the original content. This includesadditional content for the generation of VR content which is notincluded in the original content, such as information required togenerate a complete 360 degree view to a viewer.

In an example shown in FIG. 4, the original content 400 may be atraditional 2D video movie which includes a scene depicting the Statueof Liberty 401. The content analyzer 103 may recognize and identify thelandmark depicted in the scene as the Statue of Liberty and generate orobtain metadata associated with the scene of the original content 400.Based on the identified landmark 402 and the obtained metadata, thecontent analyzer 103 may query the characteristics database 104 toretrieve additional stored information 403 related to the Statue ofLiberty to generate VR content.

The additional stored information may include images of the Statue ofLiberty from various angles 404, 405, 406, 407, 408, and may alsoinclude a 3D video file (not shown) depicting the Statue of Liberty inthree dimensions which was created for the generation of anotherunrelated media content, and stored and tagged with “Statue of Liberty”related metadata in the characteristics database 104. Based on theadditional stored information 403 obtained from the characteristicsdatabase 104, VR content may be generated which includes a VRrepresentation of the Statue of Liberty 409 based on the informationprovided by the various angle views and the 3D content of the landmarkobtained from the characteristics database 104.In yet another example,and referring back to FIG. 1, the content analyzer 103 may detect andrecognize audio of a helicopter in the scene of the original content,however the video of the scene may not include a depiction of thehelicopter. Based on the audio of the original content, the contentanalyzer 103 may identify the presence of a helicopter and query thecharacteristic database 104 for additional stored information related tohelicopters. The additional stored information may then be used, asdescribed further below, to generate VR content in which a depiction ofthe helicopter is visible when a viewpoint of the user is changed toface the direction of the source of the helicopter audio.

In another embodiment, the characteristics database 104 may includeaudio information associated with stored metadata. In such embodiment,the content analyzer 103 may query the characteristics database 104using the helicopter audio from the original content, by using anacoustic fingerprint or other representation of the audio. Thecharacteristics database 104 may provide additional stored informationassociated with the audio if the query matches stored audio or storedacoustic fingerprints. For example, the additional stored informationmay include the model of the helicopter identifiable by a distinct rotorsound, and may further include images, 2D video, or 3D video of theparticular model of the helicopter to be used for generating the VRcontent based on the original content.

In other embodiments, the content analyzer 103 may be configured toanalyze and recognize various features of the audio or video of theoriginal content, such that the content analyzer 103 may performrecognition processes using other information sources prior to queryingthe characteristics database 104.

In one embodiment, the system 100 may further include a VR featuresdatabase 105 which stores previously created VR content. The previouslycreated VR content may be associated with identifying metadata to enablethe system 100 to search for and retrieve the previously created VRcontent. In the previously discussed example involving the Statue ofLiberty, the VR features database 105 may include VR content informationdepicting the Statute of Liberty which was created as VR content duringthe content creation process, including capturing all environments andobjects from numerous viewpoints. Thus, the content analyzer 103 mayquery the VR features database 105, in addition to the characteristicsdatabase 104, to retrieve previously created VR content related to theStatue of Liberty, including individually captured video frame images,and the like.

In another example, the retrieved VR content information from the VRfeatures database 105 may include VR content which was created by the VRcontent generating system 100 based on original non-VR content, whichwas stored in the VR features database 105 after being generated.Referring back to the example of FIG. 4, after the generated VR content409 including the VR representation of the Statute of Liberty isgenerated, the VR content 409 may be stored in the VR features database105. The VR content 409 may be stored to be associated with identifyingmetadata associated with the original content, the scene of the originalcontent, as well as metadata associated with the landmark. Thus, thestored generated VR content 409 may be retrieved for use in latergeneration of VR content in connection with other and unrelated originalcontent.

In other embodiments, the system may include a VR objects database and aVR scene database (discussed further with reference to FIG. 2). In suchembodiments, the VR objects database may be configured to store VRcontent information related to specific and identified objects which areextracted and may be incorporated into the generation of new VR content.Referring briefly to the example of FIG. 4, information of the VRrepresentation of only the Statue of Liberty extracted from thegenerated VR content 409 may be stored in the VR objects database.

Additionally, the VR scene database may be configured to store VRcontent information related to complete scenes which may be used andincorporated into the generation of new VR content. Examples of thisinclude VR scenes which may be implemented for various original content,such as a VR scene of a beach, a VR scene of a waterfall, a VR scene ofwildlife, and the like. Referring briefly to the example of FIG. 4,information of the entire VR scene corresponding to the generated VRcontent 409 may be stored in the VR scene database.

The VR generator 103 may further include a VR synthesizer 106 whichreceives the analyzed original content, additional information relatedto the original content retrieved from the characteristics database 104,and additional VR information retrieved from the VR features database105. The VR synthesizer 106 may be configured to map the additionalinformation to the original content in order to generate VR content 107based on the original content.

In one example, the VR synthesizer 106 may be configured to generate a3D surface map of a face or an object included in the original content,and utilize the additional information related to the face or the objectretrieved from the characteristics database or the VR features databaseto generate a full 3D representation of the face or object forgeneration of the VR content. Other known 3D mapping techniques areconsidered for generating the VR content based on the original contentand the additional stored information, and one of ordinary skill in theart will appreciate that various techniques and methods known and usedin the related art may be applied in this aspect of the presentdisclosure.

The VR synthesizer may further incorporate audio information receivedfrom the content analyzer 103, including multi-channel audio todetermine a position of an object or character within the generated VRspace if the object or character is not depicted in the originalcontent. The audio information may also be used to determine movement ofan object or character within the VR space, for example where the VRcontent is generated to include a helicopter flying from left to rightas determined from the directional audio information.

After being generated, the VR content may be provided to another partyfor distribution or display to a viewer. The VR content may be providedalong with the original source content to enable a viewer to selectwhich format is desired for consuming the content. Various embodimentsare considered where the viewer is able to switch between a traditional2D or 3D format of the content or the generated VR content experience.

The generated VR content may also be stored in the VR features database105 for use in other VR content generation processes. In someembodiments, separate VR information may be stored in the VR objectsdatabase and the VR scene database for use in other VR contentgeneration processes. The stored VR information may be associated withvarious metadata related to the content of the generated VR informationincluding all objects, persons, landmarks, scenes, environmentconditions, and the like, within the content, as well as the originalcontent corresponding to the generated VR content.

The VR content may also be generated to include all or some of themetadata identified, generated, or collected during the generation ofthe VR content. For example, if a particular actor is identified in theoriginal content and the VR content includes a VR representation of theparticular actor, the VR content may include metadata of the actorwithin the generated content information.

In yet other embodiments, various objects, persons, landmarks, and thelike within the generated VR content may be actionable by viewer toactivate and view the associated metadata. In such embodiments, a VRrepresentation of the particular identified actor may be clickable orotherwise selectable within the VR space in order to view the metadataassociated with the particular actor. The VR content may also begenerated to include feedback instructions to be performed, such as byproviding tactile or haptic feedback, and the like, in response to anaction or interaction with a user. In other embodiments, the associatedmetadata may not be directly displayed to a user, but the metadata maybe used to connect to external information sources, such as storeddatabases or the internet, to search for and retrieve informationrelated to the particular actor based on the stored metadata in thegenerated VR content.

In some embodiments, the VR generator 103 may comprise one or moresoftware or hardware computer systems. In an embodiment the VR generatormay comprise or be operably coupled to one or more hardware memorysystems for storing information including databases for storing,accessing, and querying various content and characteristics information.The one or more computer systems of the VR generator 103 may include oneor more computer processors or controllers configured to processinformation, generate controlling commands, and the like. In oneembodiment, the VR content analyzer and the VR synthesizer may eachcomprise a processor of the computer system, and in other embodiments asingle processor may be configured to perform the associated functions.The various computer systems, memory, and components of the system 100may be operably coupled to communicate information, and the system mayfurther include various hardware and software communication modules andcircuitry to enable wired or wireless communication of information.

Turning now to FIG. 2, an aspect of the content analyzer 202 of the VRgenerating system 200 is shown. As previously discussed with respect toFIG. 1, the original content DB 101 or other content source may provideoriginal content in the form of traditional 2D video, 2D multiviewcontent, 3D video content, or the like, as discussed. The contentanalyzer 202 may include various components, including objectrecognition 203, audio recognition 204, and text extraction 205components.

The object recognition component 203 may receive video or images of theoriginal content and perform object detection and recognition processesin order to recognize various objects, landmarks, faces, and the like,depicted in the original content. The object recognition information isthen passed to an object analyzer component 208 which generates a querybased on the object recognition information to perform identificationprocesses and retrieve additional related information stored in theobject characteristics database 206, as previously discussed.

As an example, the object recognition component 203 may analyze theoriginal content and detect a face in the original content. The objectanalyzer 208 may query the object characteristics database 206 withmetadata of the detected face, wherein the query incorporates variousmetadata of the particular face. The object analyzer 208 may also beconfigured to query various other information sources to obtainidentifying information of the particular face. In one example, theobject analyzer 208 may query information provided by a third party,such as an image repository connected via a network such as theinternet. In response to the query, the object analyzer 208 may receiveidentifying information of a particular actor corresponding to therecognized face of the original content. The object characteristicsdatabase 206 may also include stored information associated withmetadata of the particular actor, including images or other videocontent depicting the actor's face from various angles.

In such an example, 2D video content stored in the objectcharacteristics database 206 may be associated with metadata of the nameof the actor and may include an actor turning his or her head to oneside in a one-second 2D video clip. The video clip may have been createdfor use in separate unrelated media, or in other embodiments it may havebeen created specifically to populate the object characteristicsdatabase 206. Further in the example, if the one-second video clip isoriginally shot in 48 frames-per-second format, the stored informationmay provide up to 48 individual video frame images of the actor's headand face from varying angles, thereby providing additionalthree-dimensional information for mapping the actor's face and head inthe generation of the VR content.

In an embodiment, the system also includes a VR scene database 212 andVR object database 213. The VR scene database 212 may be configured tostore VR content information related to complete scenes which may beused and incorporated into the generation of new VR content. The VRobjects database 213 may be configured to store VR content informationrelated to specific and identified objects which are extracted and maybe incorporated into the generation of new VR content. The objectanalyzer 208 may also query the VR scene database 212 and the VR objectdatabase 213 for existing VR content which corresponds to the identifiedobjects, people, locations environments, and the like, of the originalcontent. The query to the VR scene database 212 and the VR objectdatabase 213 may include the same or similar metadata information as thequery to the object characteristics database 206 and the soundcharacteristics database 207 as previously discussed.

The audio recognition component 204 may receive audio of the originalcontent and perform audio recognition processes in order to recognizevarious speech, voices, noises, and sounds included in the originalcontent. The audio recognition information is then passed to an audioanalyzer component 209 which generates a query based on the audiorecognition information to retrieve additional related informationstored in a sound characteristics database 207. In some embodiments, theaudio recognition component 204 may be configured to extract specificaudio characteristics and perform recognition using a database ofpreviously collected and stored audio characteristics, or may beconfigured to query third party sources such as an audio clip repositoryconnected via a network, such as the internet, as similarly discussedabove with respect to the object recognition component 203. Using therecognition information, the audio analyzer 209 may query the soundcharacteristics database 207 to identify certain objects, features,conditions, people, and the like, included in the original content. Insome embodiments, the audio analyzer 209 may also be configured to querythe VR scene database 212 and the VR object database 213 for existing VRcontent related to the recognition information, similar to the querypreviously discussed with respect to queries generated by the objectanalyzer 208.

For example, as previously discussed, the distinctive acousticfingerprint of a helicopter rotor sound may be used by the audioanalyzer to match with an acoustic fingerprint or other identifyingaudio information of a helicopter stored in the sound characteristicsdatabase 207. In another example, a voice may be recognized by the audiorecognition component 204, and the audio analyzer may query the soundcharacteristics database 207 to match the recognized voice with voiceinformation stored in the database in order to identify the particularactor or person. Thus, the audio analyzer 209 may also provide theobject analyzer 208 with additional information—by identifying audio ofpeople, objects, and the like—to perform object analysis in order togenerate an accurate metadata query for additional stored informationfrom the object characteristics database 206.

In other embodiments, the audio recognition component 204 may providecontext information for particular objects to be generated in the VRformat. For example, the audio recognition component 204 may detect thatthe audio of the helicopter provided in multi-channel audio indicatesthat the helicopter is flying through a scene in a left to rightdirection. Based on this recognized information, the audio analyzer maygenerate context information for generating the VR scene to compile avirtual representation of its position, speed, and direction within thevirtual space.

Additionally, information of the identified helicopter may be retrievedand included in the generated VR content even where the original contentdoes not include a visual depiction of the helicopter. For example, in ascene of the original content, the rotor sound of a helicopter may beheard just off screen, but no depiction of the actual helicopter may beincluded. Based on the audio information as previously discussed, thehelicopter, as well as its position with respect to the viewer of theoriginal content may be identified. The information provided by theaudio analyzer may be used by the object analyzer to retrieve images orvideo of the identified helicopter stored in the object characteristicsdatabase 206, VR scene database 212, or VR object database 213 to bemapped to the audio of the original content and used in generating 360degree VR content which includes a view of the helicopter when the VRviewer is turned in the identified direction of the helicopter. In someembodiments, the information obtained from the audio recognitioncomponent 204 may provide environmental information in addition to thecontext information for particular scenes to be generated in the VRformat by providing a full 360 degree VR experience to a viewer. Forexample, the audio recognition component 204 may detect soundsrecognized as crashing waves along a shore. Based on the recognizedaudio information, the audio analyzer 209 may provide information to theobject analyzer 208 and the context analyzer 211 that the VR scene to begenerated involves a coast or beach environment. The information fromthe audio recognition component 204 may be implemented to inform variousother environmental conditions, such as flashes, rain, wind, traffic,indoors, outdoors, nighttime, daytime, morning, and the like. Thecontext analyzer 211 may use the information related to a coast or beachenvironment to retrieve existing information stored in the objectcharacteristics database 206, sound characteristics database 207, VRscene database 212, or VR object database 213 associated with a coast orbeach environment to provide the VR synthesizer with environmentinformation to generate the VR content.

Therefore, the context analyzer may retrieve additional stored contentindicated by objects, environments, settings, and the like which areprovided by audio, video, and text of the original content in order toretrieve additional stored content to complete the 360 degree VRcontent, where the original content, such as traditional 2D originalcontent for example, does not include such information.

Referring back to the example of FIG. 4, the context analyzer 211 mayobtain information of the Statue of Liberty 401 depicted in the originalcontent 400. Based on this information, the context analyzer 211 mayidentify that the scene context is based in New York and may retrieveadditional stored information associated with New York metadata, as wellas specific image, sound, and object information stored in the objectcharacteristics database 206, sound characteristics database 207, VRscene database 212, or VR object database 213 to retrieve theinformation needed to generate a full 360 degree view of the Statue ofLiberty area indicated by the original content 400. The retrievedinformation may include environmental or object information of theStatue of Liberty area that was created and used for other mediacontent, or that was created specifically to collect video and audioinformation for later use by the VR generator. Thus, the contextanalyzer 211 retrieves information not included in the original content400 in order to identify sufficient information needed to generate VRcontent capable of providing a 360 degree view to a viewer based on theoriginal content, although the original content may only includeinformation for a single viewing angle in the environment.

Additionally, a text extraction component 205 may be included in thecontent analyzer 202 to extract text information from the originalcontent. Extracted text from the original content may include subtitles,closed captioning content, actual text depicted within the originalcontent, and the like. The extracted text may be provided to the textanalyzer 210. Additionally, the text analyzer 210 may receive textinformation from the audio recognition component 204 resulting fromspeech-to-text recognition performed by the audio recognition component.The text analyzer 210 may process the extracted text information toprovide additional object, context, or environmental information similarto that as previously discussed with respect to the audio analyzer 209.

For example, if extracted text information indicates that a particularscene of the original content occurs in a restaurant (i.e.,speech-to-text information is received of characters ordering from amenu), the text analyzer 210 may provide environment context informationto the context analyzer 211 to inform that the VR scene is to begenerated within a restaurant setting. In some embodiments, the textanalyzer may also provide the environment context information to theaudio analyzer 209 and the object analyzer 208. This environment contextinformation may contribute to improving efficiency when generatinginformed queries to the object characteristics database 206 and thesound characteristics database 207.

Additionally, the context analyzer 211 may receive the informationgenerated and retrieved by the object analyzer 208, audio analyzer 209,and the text analyzer 210 to generate additional features of thegenerated VR content based on the environment and context of theoriginal source content. For example, the context analyzer 211 maydetermine that a particular scene occurs in a beach setting, and mayquery the object characteristics database 206, sound characteristicsdatabase 207, VR scene database 212, and VR object database 213 toretrieve additional information to generate a complete scene in thegenerated VR content. Thus, in addition to images and video to generatethe visual aspects of the VR content, the context analyzer may alsoretrieve audio information such as ambient sounds such as backgroundseagull sounds, a sound of crashing waves along a shore, a backgroundsound of a bell being rung from a nearby boat, and the like. The contextimages, video, audio, and other information may thus be retrieved andoutput by the context analyzer 211 for generation of the VR content.

Various embodiments of the system 200 may further comprise additionalaspects which have been omitted for the purposes of the discussion ofFIG. 2, and it will be understood by those of ordinary skill in the artthat all combinations of aspects of the system are considered includinga VR features database, VR synthesizer, and the like, as discussed withrespect to the examples of FIG. 1. Additionally, it will be understoodthat this disclosure considers embodiments of a system for generating VRcontent wherein the system includes all of the components depicted anddiscussed herein, including a system having hardware or softwarecomponents implemented together, or separated and connected by knownnetworking and communication methods and techniques.

As previously discussed, in some embodiments the content analyzer 202 ofthe system 200 may comprise one or more software or hardware computersystems and may further comprise or be operably coupled to one or morehardware memory systems for storing information including databases forstoring, accessing, and querying various content and characteristicsinformation. In hardware implementations, the one or more computersystems incorporate one or more computer processors and controllers.

The components of the various embodiments discussed herein may eachcomprise a hardware processor of the one or more computer systems, andin one embodiment a single processor may be configured to implement thevarious components. For example, in one embodiment, the object analyzer208, audio analyzer 209, and text analyzer 210 may be implemented asseparate hardware systems, or may be implemented as a single hardwaresystem. The hardware system may include various transitory andnon-transitory memory for storing information, wired and wirelesscommunication receivers and transmitters, displays, and input and outputinterfaces and devices. The various computer systems, memory, andcomponents of the system 200 may be operably coupled to communicateinformation, and the system may further include various hardware andsoftware communication modules, interfaces, and circuitry to enablewired or wireless communication of information.

Turning now to FIG. 3, the figure shows a flowchart including operationsof a method 300 of an embodiment of the present disclosure forgenerating VR content. The method 300 includes an operation block at 301of obtaining source information from the original media content,including the object recognition information, audio recognitioninformation, and extracted text information by the content analyzer asdiscussed above with reference to FIG. 2. The source information mayalso include metadata of the identified objects, people, landmarks,environment conditions, and the like, analyzed by the object analyzer,audio analyzer, and text analyzer as discussed above with respect to theexample of FIG. 2.

The method 300 further includes retrieving characteristic informationrelated to the original media content content at block 302. Thisoperation may include querying the characteristics database, or separateobject characteristics database and sound characteristics database asdiscussed above with respect to the example of FIG. 2. The retrievedcharacteristic information may include additional stored informationsuch as video or images of a recognized object, person, landmark, andthe like included in the original media content. In one example, theretrieved characteristic information may include a video clip showingthe Statue of Liberty from multiple viewing angles, or a number ofimages of the Statue of Liberty from multiple viewing angles.

In another example, the retrieved characteristic information may includeidentifying information of a recognized object, such as modelinformation of a helicopter identified using an acoustic fingerprint ofthe rotor audio obtained from the original media content. In suchexamples, the retrieved characteristic information is related to theoriginal media content and includes additional stored information of theoriginal media content to enable generation of VR content.

The method 300 further includes an operation of generating VR contentbased on the obtained source information and the retrievedcharacteristic information at block 303. Generating the VR content mayinclude 3D mapping of an object or a face included in the original mediacontent content based on the retrieved characteristic information, wherethe characteristic information provides additional stored information ofthe object or face of varying viewpoints, thereby providing sufficientinformation for 3D mapping.

Generating the VR content may also include incorporating detectedinformation of the original media content, such as determining theposition of a particular person or object within the generated VR spacebased on directional multi-channel audio information associated with theperson or object, or based on text information indicating a relativeposition of the person or object.

In another embodiment, the method may further include retrieving stored3D information based on the source information. For example, the stored3D information may include previously generated or created 3D or VRcontent of a recognized object or person included in the original mediacontent. In such an embodiment, the operation of generating the VRcontent shown at block 303 may include generating the VR content basedon the stored 3D information, the source information, and the retrievedcharacteristic information. The method 300 may also include outputtingthe generated VR content to be reproduced to a viewer (not depicted).The original content may also be output in addition to the generated VRcontent to be reproduced to a viewer.

According to embodiments of the present invention, a system forgenerating VR content may include one or more computer systems, computerprocessors, storage devices, distribution channels, etc., in order toanalyze the original content, retrieve related characteristicinformation, and generate the VR content based on the original contentand the retrieved characteristic information. Portions of the originalcontent, object characteristic information, audio characteristicinformation, and VR features information may reside in differentlocations, or may be distributed widely and coordinated with oneanother, particularly in a large organization. Moreover, particularportions may reside in entirely separate environments, such as contentcreators, content distributors, and the like.

Various embodiments described herein may be implemented in acomputer-readable non-transitory medium, a machine-readable medium, orsimilar medium using, for example, software, hardware, or anycombination thereof. For a hardware implementation, the embodimentsdescribed herein may be implemented within one or more applicationspecific integrated circuits (ASICs), digital signal processors (DSPs),digital signal processing devices (DSPDs), programmable logic devices(PLDs), field programmable gate arrays (FPGAs), processors, controllers,micro-controllers, microprocessors, other electronic units designed toperform the functions described herein, or a selective combinationthereof. In some cases, such embodiments are implemented by a controlleror a processor. In all embodiments, the various components describedherein may be implemented as a single component, or alternatively may beimplemented in various separate components.

The various devices, modules, terminals, and the like discussed hereinmay be implemented on a computer by execution of software comprisingmachine instructions read from computer-readable medium. These hardwareaspects may further include an input/output interface, graphical userinterface, or other input/output means to allow the input/output ofinformation related to controlling functions, inputting information,outputting information, receiving information, or transmittinginformation. In certain embodiments, several hardware aspects may beimplemented using a single computer, in other embodiments multiplecomputers, input/output systems and hardware may be used to implementthe system.

For a software implementation, certain embodiments described herein maybe implemented with separate software modules, such as procedures andfunctions, each of which perform one or more of the functions andoperations described herein. The software codes can be implemented witha software application written in any suitable programming language andmay be stored in memory and executed by a controller or processor.

The foregoing disclosed embodiments and features are merely exemplaryand are not to be construed as limiting the present invention. Thepresent teachings can be readily applied to other types of apparatusesand processes. The description of such embodiments is intended to beillustrative, and not to limit the scope of the claims. Manyalternatives, modifications, and variations will be apparent to thoseskilled in the art.

What is claimed is:
 1. A method for generating virtual reality (VR)content, the method comprising: identifying an element in original mediacontent; generating a VR representation of the identified element basedon one or more images of the identified element such that the generatedVR representation includes visual 3D information of the identifiedelement; generating VR content corresponding to the original mediacontent wherein the VR content includes the VR representation of theidentified element, wherein the generating the VR content comprisesdetermining a position of the VR representation within the VR contentbased on directional audio information associated with the identifiedelement from the original media content; and outputting the VR contentcorresponding to the original media content.
 2. The method of claim 1,further comprising determining a movement of the VR representation ofthe identified element to be depicted within the VR content based on thedirectional audio information, wherein outputting the VR contentcomprises displaying the movement of the VR representation within the VRcontent based on the directional audio information.
 3. The method ofclaim 2, further comprising determining a speed of the movement of theVR representation of the identified element to be depicted within the VRcontent based on the directional audio information, wherein outputtingthe VR content comprises displaying the speed of the movement of the VRrepresentation within the VR content based on the directional audioinformation.
 4. The method of claim 1, further comprising searching forimages of the identified element using metadata of the identifiedelement and receiving the one or more images of the identified elementin response to the search.
 5. The method of claim 1, wherein at leastone image of the one or more images is mapped to at least one viewingangle of the visual 3D information of the identified element.
 6. Themethod of claim 1, wherein the identified element is identified bymatching an image of a frame of video of the original media content toimage information associated with the identified element.
 7. The methodof claim 1, wherein the identified element is identified by matchingaudio from the original media content to audio information associatedwith the identified element.
 8. The method of claim 7, wherein theidentified element is not visually depicted in the original mediacontent.
 9. The method of claim 1, wherein the identified elementcorresponds to a particular person in the original media content. 10.The method of claim 9, wherein the particular person is identified bymatching voice audio from the original media content to voiceinformation associated with the particular person.
 11. The method ofclaim 9, wherein the particular person is identified by matching animage of a face from the original media content with facial recognitioninformation associated with the particular person.
 12. The method ofclaim 1, wherein the VR content comprises at least one of 360 degreeimagery of a scene or 3D content.
 13. A virtual reality (VR) contentgenerating system comprising: at least one controller configured to:identify an element in original media content; generate a VRrepresentation of the identified element based on one or more images ofthe identified element such that the generated VR representationincludes visual 3D information of the identified element; generate VRcontent corresponding to the original media content wherein the VRcontent includes the VR representation of the identified element,wherein generating the VR content comprises determining a position ofthe VR representation within the VR content based on directional audioinformation associated with the identified element from the originalmedia content; and output the VR content corresponding to the originalmedia content.
 14. The system of claim 13 further comprising a VRdisplay wherein the at least one controller is further configured to:determine a movement of the VR representation of the identified elementto be depicted within the VR content based on the directional audioinformation; and output the VR content by causing the VR display todisplay the movement of the VR representation within the VR contentbased on the directional audio information.
 15. The system of claim 14,wherein the at least one controller is further configured to: determinea speed of the movement of the VR representation of the identifiedelement to be depicted within the VR content based on the directionalaudio information; and output the VR content by causing the VR displayto display the speed of the movement of the VR representation within theVR content based on the directional audio information.
 16. The system ofclaim 13, wherein the at least one controller is further configured tosearch for images of the identified element using metadata of theidentified element and receive via a communication interface the one ormore images of the identified element in response to the search.
 17. Thesystem of claim 13, wherein the at least one controller is furtherconfigured to map at least one image of the one or more images to atleast one viewing angle of the visual 3D information of the identifiedelement.
 18. The system of claim 13, wherein the at least one controlleris further configured to identify the identified element by matching animage of a frame of video of the original media content to imageinformation associated with the identified element.
 19. The system ofclaim 13, wherein the at least one controller is further configured toidentify the identified element by matching audio from the originalmedia content to audio information associated with the identifiedelement, and wherein the identified element is not visually depicted inthe original media content.
 20. The system of claim 13, wherein theidentified element corresponds to a particular person in the originalmedia content, and wherein the at least one controller is furtherconfigured to: identify the particular person by matching voice audiofrom the original media content to voice information associated with theparticular person; or identify the particular person by matching animage of a face from the original media content with facial recognitioninformation associated with the particular person.
 21. The system ofclaim 13, wherein the VR content comprises at least one of 360 degreeimagery of a scene or 3D content.
 22. A machine-readable non-transitorymedium having stored thereon machine-executable instructions for:identifying an element in original media content; generating a VRrepresentation of the identified element based on one or more images ofthe identified element such that the generated VR representationincludes visual 3D information of the identified element; generating VRcontent corresponding to the original media content wherein the VRcontent includes the VR representation of the identified element,wherein the generating the VR content comprises determining a positionof the VR representation within the VR content based on directionalaudio information associated with the identified element from theoriginal media content; and outputting the VR content corresponding tothe original media content.